Skip to content

Conversation

@Wennie396
Copy link
Contributor

@Wennie396 Wennie396 commented Sep 23, 2025

PR types

Others

PR changes

Others

Description

当export HACK_OFFLOAD_OPTIMIZER=1并且开启pp或mp的sync_param的时候,可以通过设置export FLAGS_offload_opt_cache_optstates=1减少一次master weight的offload和reload

@paddle-bot
Copy link

paddle-bot bot commented Sep 23, 2025

Thanks for your contribution!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant